Binary Mask Estimation for Improved Speech Intelligibility in Reverberant Environments

نویسندگان

  • Oldooz Hazrati
  • Jaewook Lee
  • Philipos C. Loizou
چکیده

A blind (non-ideal) time-frequency (T-F) masking technique is proposed for suppressing reverberation. A binary mask is estimated at each T-F unit by extracting a single variance-based feature from the reverberant signal and comparing its value against an adaptive threshold. The performance of the estimated binary mask is evaluated using intelligibility listening tests with hearing impaired listeners in four moderate to highly reverberant conditions. Results indicated that the proposed T-F masking technique yielded significant improvements in intelligibility even in highly reverberant conditions (T60 = 1.0 s). This improvement was attributed to the recovery of the vowel/consonant boundaries which are severely smeared in reverberation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ideal Ratio Mask Estimation Using Deep Neural Networks for Monaural Speech Segregation in Noisy Reverberant Conditions

Monaural speech segregation is an important problem in robust speech processing and has been formulated as a supervised learning problem. In supervised learning methods, the ideal binary mask (IBM) is usually used as the target because of its simplicity and large speech intelligibility gains. Recently, the ideal ratio mask (IRM) has been found to improve the speech quality over the IBM. However...

متن کامل

Classification based binaural dereverberation

Reverberation has a detrimental effect on speech perception both in terms of quality as well as intelligibility, as late reflections smear temporal and spectral cues. The ideal binary mask, which is an established computational approach to sound separation, was recently extended to remove reverberation. Experiments with both normal hearing and hearing impaired listeners have shown significant i...

متن کامل

Effect of the division between early and late reflections on intelligibility of ideal binary-masked speech.

The ideal binary mask (IBM) that was originally defined in anechoic conditions has been found to yield substantial improvements in speech intelligibility in noise. The IBM has recently been extended to reverberant conditions where the direct sound and early reflections of target speech are regarded as the desired signal. It is of great interest to know how the division between early and late re...

متن کامل

Decreasing speaking-rate with steady-state suppression to improve speech intelligibility in reverberant environments

1. Introduction It is known that strong reverberation affects speech intelligibility. Although early reflections often help speech intelligibility (the Haas effect, e.g., [1]) late reflections degrade speech intelligibility [2]. Overlap-masking in rever-berant environments is the main source of degradation in speech intelligibility [3–5]. Because of overlap-masking, reverberant components of pr...

متن کامل

A procedure for testing speech intelligibility in a virtual listening environment.

OBJECTIVE The development of a test of virtual speech intelligibility in noise that enables assessment in typical, everyday listening situations. To eliminate extraneous confounding factors, digital signal processing was incorporated to simulate listening environments and source locations and allow presentation of stimuli via earphones. DESIGN Source-to-eardrum transfer functions measured on ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012